Understanding Catastrophic Forgetting in Language Models via Implicit Inference
https://arxiv.org/abs/2309.10105
effects of fine-tuning (via methods such as instruction-tuning or reinforcement learning from human feedback)